Extraction of Syntactic Structures Based on the Czech Parser Synt
نویسنده
چکیده
In this paper we describe the usage of the syntactic parser synt (developed in the NLP Centre at Masaryk University) to gain information about syntactic structures (such as noun or verb phrases) of common sentences in Czech. These structures are from the analysis point of view usually identical to nonterminals in the grammar used by the parser to find possible valid derivations of the given sentence. The parser has been extended in such a way that enables its highly ambiguous output to be used for extracting those syntactic structures unambiguously and gives several ways how to identify them. To achieve this, some previously unused results of syntactic analysis have been evolved leading to more precise morphological analysis and hence also deeper distinction among various syntactic (sub)structures. Finally, we present an application for shallow valency extraction.
منابع مشابه
Grammar Development for Czech Syntactic Parser with Corpus-based Techniques
In the paper, we present the description of the Czech syntactic parser synt developed at FI MU NLP laboratory. The presented system is based on the meta-grammar formalism with a head-driven chart parser. The parsing technique provides fast analysis of the context free backbone with successive evaluation of the contextual constraints using so called “forest of values.” The meta-grammar formalism...
متن کاملMeasuring Coverage of a Valency Lexicon using Full Syntactic Analysis
Recent development showed that valency information provides a great benefit in many areas of natural language processing. Building valency lexicons is however a complex and time-consuming task from both theoretical and practical points of view, since designing of the lexicon plays a crucial role in its future usability as well as its careful and considerated preparation. As for any manually cre...
متن کاملTest Suite for the Czech Parser Synt
This paper presents a set of tools designed for testing the Czech syntax parser that is being developed at the Natural Language Processing Centre at theMasaryk University, synt. Testing the parser against a newly created phrasal tree corpora is very important for future development of the parser and its grammar. The usage of the test suite is not restricted to the synt parser but is open to wid...
متن کاملDiscovering Grammatical Relations in Czech Sentences
The syntactic parser synt developed at NLP Centre, Faculty of Informatics, Masaryk University, can provide as one of its possible outputs a list of dependency relations discovered in the analysed sentence. In the paper, we present the result of codification and translation of the (rather technically labeled) dependency relations from synt to linguistically significant relations. The resulting r...
متن کاملAnalyzing Time-Related Clauses in Transparent Intensional Logic
The Normal Translation Algorithm (NTA) for Transparent Intensional Logic (TIL) describes the key parts of standard translation from natural language sentences to logical formulae. NTA was specified for the Czech language as a representative of a free-word-order language with rich morphological system. In this paper, we describe the implementation of the sentence building part of NTA within a mo...
متن کامل